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Thermodynamic cost of reversible computing 

Lev B. Levitin and Tommaso Toffoli 



Since reversible computing requires preservation of all information throughout the entire computational 
process, this implies that all errors that appear as a result of the interaction of the information-carrying 
system with uncontrolled degrees of freedom must be corrected. But this can only be done at the expense 
of an increase in the entropy of the environment corresponding to the dissipation, in the form of heat, of the 
"noisy" part of the system's energy. 

This paper gives an expression of that energy in terms of the effective noise temperature, and analyzes the 
relationship between the energy dissipation rate and the rate of computation. Finally, a generalized Clausius 
principle based on the concept of effective temperature is presented. 
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1 Cost of computing in the pres- 
ence of noise 

The concept of reversible computing was introduced in 
[TJ [TTJ [3] with the idea to get rid of the immense energy 
dissipation and heat generation caused by the irreversibil- 
ity of conventional computing processes. In general, logical 
reversibility in computation is a necessary but not sufficient 
condition for physical reversibility. However, in quantum 
computing these two characteristics come together: any vi- 
olation of physical reversibility of the evolution of the con- 
trolled information-carrying degrees of freedom represents 
noise that, if left unchecked, will destroy the computation. 
Thus, it is natural to consider the problem of reversible 
computing in a quantum milieu. 

Reversible computing is by its nature closer to commu- 
nication than to conventional computing, and therefore it 
calls for being analyzed from the information-theoretical 
standpoint. Indeed, if it is true that "communication is 
computation of the identity function," then reversible com- 
putation is computation of a bijective function: there is a 
one-to-one correspondence between the set of all possible 
input data and that of all possible results of the specified 
computation. Paradoxically, not only does this requirement 
make the physical implementation of reversible computa- 
tion more difficult, but it also creates a specific cause of 
energy dissipation and increase of the entropy of the envi- 
ronment. 

Indeed, while at every stage of a conventional computing 
process we are only concerned with the integrity of that part 
of the information that is going to contribute to the final 
result (usually, a small fraction of the information presented 
in the input data), the precondition of reversible comput- 
ing is preservation of all information included in the choice 
of the initial conditions for any particular instance of com- 
putation throughout the the entire process of computing. 
However, this goal cannot be achieved for free. The state of 
the information-carrying system (the controlled degrees of 
freedom) is subject to noise owing to two factors: the statis- 
tical nature of the interaction between the system and the 
external devices implementing the required transformation 
of the system state, and the interactions with uncontrolled 
degrees of freedom (the environment). As a result, errors 
appear in the state of the system, which requires correc- 
tions that inevitably lead to energy dissipation, i.e., to an 
increase of the entropy of the environment. 



Henceforth we call 'system' the collection of controlled 
degrees of freedom involved in the process of computation, 
rather than the entire physical object. 

Suppose that we start with an ensemble {gf ,p{\ of or- 
thogonal, pure initial states, described by density matrices 

that occur with probabilities pi. Then the initial infor- 
mation Iq (i.e., the information in the ensemble of the cho- 
sen states gl about the ensemble of the initial data labeled 
by i) is given by 



To = - \npi = -Trg In g°, 



where g° = ^2 i Pidl is the a priori density matrix of the en- 
semble. The entropy of each initial state is zero. After per- 
forming a step of the computational process (such as, e.g., 
application of a "quantum gate"), we obtain an ensemble 
{gi,Pi}, where the density matrices Qi are in general neither 
pure nor orthogonal. Note that, in general, matrices gi may 
describe states of a different physical system than the states 
characterized by gf. What is important, however, is that 
there exists a one-to-one correspondence between the sets 
{e°i} and {gt}. _ 

The average entropy H of the state after the computation 
step is 



H 



PiTrgilngi > 0. 



Now the information I conveyed by the state ensemble 
about the initial state is 



I < H - H < In 



(1) 



where 



H = — Trgln g, with g = y^pjgj. 



Inequality (p} follows from the entropy defect principle [4]. 

In order to achieve reversibility, the initial ensemble of 
states should possess sufficient redundancy to turn inequal- 
ity |T]) into an equality, or, in other words, to make 1 = 1$. 
Then the output state Qi uniquely determines the corre- 
sponding input state The reversibility condition allows 
us to consider a "reverse channel" with ensembles {gi,Pi} as 
input and {g®,Pi} as output. In this channel, the quantity 
H plays the role of equivocatianQI)]. It expresses the ef- 
fect of noise and the presence of errors in the results of the 
computation. According to Shannon's 10 th theorem (the 
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"correction channel theorem" [TD] ) , H is equal to the mini- 
mum additional information required in order to correct all 
the errors in the state of the system and thereby preserve 
the initial amount of information (though, in general, in a 
transformed form). In other words, to correct the errors, 
the entropy of the information-carrying system must be de- 
creased by the value of H . Of course, this can be done 
only at the price of an entropy increase of at least H in 
other degrees of freedom (e.g., the environment). The ac- 
tual amount of energy that has been dissipated within the 
system depends on the concrete properties of the system 
itself, as shown by the following considerations (cf. [5]): 

Denote by E(T') the energy of the system in the state 
of thermal equilibrium at temperature T' . The increase of 
the average entropy H means that part of the energy has 
been transformed into heat in the system or transferred to 
the system in the form of heat. How large is this part? It 
is equal to the energy E{T) that the system would have 
had in the state of thermal equilibrium at temperature T, 
where T is determined by the value H of entropy increase 
according to the relation 



H = H(T)= f^^V; 
V ' J kT' dT' 



(2) 



thus, T has the meaning of an effective noise temperature. 

The concept of effective noise temperature was first intro- 
duced in [5] . Being applicable to any non-equilibrium state 
of a system, the effective temperature has, nevertheless, the 
same fundamental properties as the usual temperature in 
the thermodynamics of equilibrium systems (and coincides 
with latter at equilibrium). In particular, it can be shown 
that if two non-equilibrium physical systems have effective 
temperatures Ti and T 2 , where T\ < T 2 , and corresponding 
"thermal" energies Ei(Ti) and E 2 (T 2 ) as denned by ©, 
it is impossible to transfer an amount of energy AE from 
the first system to the second by decreasing E\(Ti) by an 
amount AE and correspondingly increasing EziT^) by the 
same or a smaller amount as a sole result of a physical pro- 
cess. In fact, this property represents a generalized Clausius 
principle^. One is thus justified in regarding E(T) as an 
amount of energy converted into heat. 

In the absence of noise (H = 0), after one step the system 
will find itself in the pure state corresponding to the initial 
state q®. If H 7^ 0, this implies a nonzero probability of 
being in a state different from the correct one — the proba- 
bility of error. Correspondingly, in the calculation of E(T), 
if H — 0, then T = and E{T) = 0, as if the system were 
in its ground state. If H ^ 0, then T ^ and E{T) ^ 0. 
E(T) can be expressed alternatively as a function of the 
entropy H , or of the probability of being in a non-ground 
state — which we interpret below as the probability of error. 

It follows from the above considerations that the proce- 
dure of error correction entails removing an amount of heat 
E(T) from the system and applying to it an amount of work 
required to restore the correct state. 

Even though it arises from disturbances introduced into a 
system by interactions with its environment, the noise tem- 
perature is not equal, in general, to the temperature of the 
environment itself (were the latter at thermal equilibrium) . 



Note that the minimum amount of heat Q c to be ulti- 
mately transfered at every step to an environment at tem- 
perature T e (an "infinite, constant-temperature heat sink" ) 
to ensure continuous, closed-cycle operation of the invert- 
ible computing machinery envisaged here is not equal, in 
general, to E(T). In fact, in a reversible process, Q c = HT C . 
Indeed, if T c < E(T)/H we could even produce some "use- 
ful work" by removing heat E(T) from our system. How- 
ever, in a real situation, usually T c > E{T)/H (we cannot 
allow the computing system to "heat up" too much since 
too large a probability of error would make the state of the 
system incorrectible) and thus Q c > E(T). 

One could reason that the amount of dissipated energy 
can be made arbitrarily small if one works with "low en- 
ergy" states. But it should be borne in mind that the lower 
the energy, the larger the time taken by each computational 
step. As shown in El [6], the minimum time for trans- 
forming a state to an orthogonal state ( "flipping a qubit" ) 
is 

h 

T ~4E> 

where h is Plank's constant and E the average energy of 
the system. This inequality turns into an equality only for 
a system with two orthogonal states. For a sequence of N 
mutually orthogonal states a stronger inequality, 



t > 



N - 1 h 
N 2~E' 



becomes valid[S]. Thus, the maximum rate of computation, 
i.e., the number R of computational steps per unit time is 
proportional to the quantum-mechanical average energy of 
the system, that is, 

4E 

R=^- (3) 



for two orthogonal states and 



R = 



N 2E 
N-l~h 



(4) 



for a sequence of ./V such states. 

As shown below, it follows from |(2}, ([3]), and (j4|) that, 
the faster one wants to perform a computation, the more 
energy per step will be dissipated in the form of heat. The 
energy dissipation per computational step is then expressed 
as an increasing function of both the noise temperature and 
the rate of computation (i.e., the number of computational 
steps per unit time). 

2 Examples 

We shall consider first two representative examples. 

Example 1. The qubit. 

Let a system with two orthogonal states (a two- 
dimensional Hilbert space) have two energy levels, Eq = 
and Ei. The maximum rate of computation is achieved 



for two pure states 4>i 



U\E ) + |£d)) and 



V2 



TTjd-E'o) — \E>i))> with quantum- mechanical average energy 
E = E\/ 2 (cf. [5]), which turn into one another at each 
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computational step. Their density matrices in the station- 
ary basis {\E }, \Ei}} are 



In this case 



1/2 1/2' 
1/2 1/2 



R 



and Q® 
- I 2£i 



1/2 -1/2 
-1/2 1/2 



(5) 



ft ft 

Suppose now that there is a probability e of error, so that 
the resulting states are each a mixture of the correct state 
with probability 1 — £ and the other state with probability 
£, with density matrices 



Qi = 



1/2 1/2 -£ 
1/2 -£ 1/2 



<?2 = 



1/2 
-1/2 + £ 



-1/2 + £ 
1/2 



The average entropy of the states is 

H = -sine- (1 -e)ln(l - s) 



(6) 



and is equal to the entropy of a thermal equilibrium state 
with temperature T described by the density matrix 



Peq = 



1 -£ 
£ 



where e = 



-Ei/kT 



1 + e - E i/ kT ' 



(7) 



The states in ([6} and ([7|) are thermodynamically equiv- 
alent, since they have the same entropy H. In particular, 
they can be represented by mixtures of a pure state and 
the maximum-entropy state [ ^ 2 ] with the same coeffi- 
cients, respectively, 1 — 2e and 2e: 



Qt = (1 - 2e) 
gcq = (1 - 2e) 



1/2 ±1/2' 

±1/2 1/2 

1 0' 




2£ 



1/2 ' 
1/2 



1,2: 



2e 



1/2 ' 
1/2 



Thus, T in {7J is the effective noise temperature as defined 
by ©. 

The energy of the system at the thermal equilibrium state 
with temperature T is 



E(T) = 

Hence, from (|S} and ©, 

E(T) 
and thus, by (O, 



-Ei/kT 



E(T) 



1 + e - El /kT^- 

e -hR/2kT hR 
l + e -hR/2kT 2 ' 

fti? 



(8) 



-£. 



The quantity E(T) represents the lower bound on en- 
ergy dissipation per step of computation — in other words, 
per "flipping of the qubit." Thus, for a qubit, assuming a 
fixed probability of error, the minimum energy dissipation 
is proportional to the rate of computation. Consequently, 
the amount of heat Q generated per unit of time is propor- 
tional to the square of the rate of computation: 



Q 



hR 2 



(9) 



Example 2. Quantum harmonic oscillator. 

A sequence of N orthogonal non-stationary states of a 
quantum harmonic oscillator consists of wavefunctions 



ipn 



N-l 

=Y^ -t= e - 2 ™ mn / N \E n ), 

n=0 



N 



\E n ), m,n=0,l,...,N-l, (10) 



where \E n ) is the stationary state with energy E n = nAE 
and AE is the separation of the energy levels (the ground- 
state energy is taken to be zero). The average energy is 
E = (N — l)AE/2. 

Sequence (I10[) provides the maximum rate of computa- 
tion, given by 

27V E AE 

R = = N . 

N-lh h 

Assume now that, as a result of errors caused by noise, 
the resulting state is not pure, but has an entropy H. To 
calculate the part of the energy turned into heat, consider 
the state of thermal equilibrium with the same entropy. The 
energy corresponding to the thermal equilibrium of a har- 
monic oscillator at temperature T is 



E(T) 



AE 



e AE/kT _ l ' 



Hence, for a given maximum computation rate, the energy 
dissipation per computational step is 



E(T) = 



hR/N 



e hR/NkT _ i ' 



The error probability (the total probability to be in a state 
which is orthogonal to the correct one) is in this case 



_ g-AB/KT _ e -hR./NkT 



Hence, in terms of error probability, 

£ hR 



E(T) 



l-£ N 



Thus, the rate of energy dissipation (i.e., heat production 
per unit time) is 

£ hR 2 

(11) 



1 — £ N 



Through expressions © and (|lip. the above two exam- 
ples suggest a general conjecture — that for a fixed error 
probability (i.e., for a given intensity of noise effects) the 
energy dissipation rate increases quadratically with the rate 
of computation. 

In the next section we shall confirm this conjecture in the 
limiting case of a large number of degrees of freedom. 

3 General system with 

many degrees of freedom 

The exact form of the expression for heat production de- 
pends on the specific details of the computing system's 
energy-level structure. Nevertheless, under rather general 



assumptions, a closed- form result can be obtained for a 
broad class of systems. 

Following the analysis given in [3J, let's consider a system 
with many degrees of freedom that runs through a long se- 
quence (N 1) of mutually orthogonal states. It is shown 
in [5J Sect. 2.3] that all those states are superpositions of 
energy eigenstates with all different values of energy, 



AT-l 

E 

71=0 



\i>m) = y]c n e- 2 ™ E "' EN \E n ), m = 0,1,..., N-l, 
where 

/ E n+ i — E n 

and the average energy E is asymptotically (N ^> 1) 

E = {4> m \H\i> m )=E N /2. (12) 

Let T be the effective noise temperature of our system. 
The partition function Z and the average energy E of the 
thermal equilibrium state are 



n=0 



E n /kT 



oo 

E(T) = ±J2 E >MEn)e~ E "/ kT , 



(13) 



(14) 



where w(E n ) is the number of the microstates with energy 
E n . For a system with many degrees of freedom one has 
w(E n ) ~ E% (for n > 0), where a > 1. Typically[9], 
a» 1. Also, we assume that the zero-energy level is non- 
degenerate, i.e., w(E ) = 1. 

Let us introduce a dimensionlcss variable x = NE/En 
that characterizes the density of the energy levels: the num- 
ber of energy levels between x n and x n + Ax is equal to Ax. 
Also, suppose that w(E n ) = ax", where a is a dimensionless 
constant. Then expressions (fT3")) and (fT4"]) take the following 
form: 



Z = 1 + ax 

71=0 

1 °° F 



(E N /NkT)x„ 



-{E N /NkT)x„ 



71=0 



(15) 



(16) 



It follows from (g]) and ([12]) that, for N > 1, one has E N = 
hR. Replacing summation by integration in <| 1 5|) and (|16p . 
we obtain, using Euler's gamma function T, 

/>00 

Z = l+ ax a e-( hR ' NkT ^dx 
Jo 



l + aT(a+l) 



fNkT\ 



a+l 



E(T) = i f°° ax a+1 —e-^ NkT ^dx 



= |ar ( a + 2 )*r^i 



NkT\ a+1 



It will be convenient to express this result in terms of the 
error probability e, which is equal to the probability to be 
in a non-ground state, that is, 



1 



1 



(17) 



then 



E(T) = (a+ l)kT{\ ) = [a + X)kTe. 

z 

Expressing kT in terms of e from (|17p . we find that 



E(T) 



(a + l)e 



1-e ar(a+l) 



(18) 



Thus, the energy dissipation per computational step is pro- 
portional to the rate of computation, and so 



Q - hR 2 , 



(19) 



i.e., the rate of heat production is proportional to the square 
of the computation rate. 

Appendix: A generalized Clausius 
principle 

Theorem 1 Given two (in general, non-equilibrium) phys- 
ical systems with entropies Hi and H2, thermal equilibrium 
functions E\(T) and EziT), and effective temperatures T\ 
and T2, defined by equations 



Hi 



1 dEj(T) 
kT dt 



dt, (i=l,2) 



it is impossible to transfer energy from the first system to 
the second so as to decrease E\ (Ti) by an amount of energy 
AE and increase i^C^) by the same or a smaller amount 
as a sole result of a physical process. We assume that the 
functions E\ , E2 that characterize the systems remain un- 
changed as a result of the process. 

Proof. Suppose £^2(^2) has been decreased by AE and 
increased by AE' < AE without any other changes 
in the environment. As a result, the systems would now 
have new effective temperatures T[ < T\ and T' 2 > T2, and 
corresponding thermal equilibrium energies 

E 1 {T[)=E 1 {T 1 )-AE, 
E 2 (T' 2 ) ^E 2 (T 2 ) + AE / , 



where 
AE' 



T' 



dE 2 (T) 
dT 



dT < AE 



Ti 



dT 



dT 



(we assume that E\ and E 2 are monotonically increasing 
functions). Hence, the total change of entropy would be 



AH = AH l + AH 2 , 



where 



AHx 



t; 



dExjT) 
TdT 



dT, 



Using the mean value theorem, we get 

AH 1 = -^AE, T[<T*<Tx, (20) 

AH 2 = ^AE', T2<T;<Ti. (21) 

Obviously, <T 1 <T 2 < T 2 *. 

It follows from (|2T)]) , (l2"Tj) that the total change of entropy 
would be negative: 

I i t* _ J 1 * 
A# = AS + — AS' < -1 -AJ5 < 0. 

r* r* - t 1 *t 2 * 

Thus, such a process is impossible. ■ 

Note that the generalized Clausius principle does not pre- 
clude any exchange of work between the two systems, i.e., a 
process that does not affect the values of E\ (Ti) and E 2 (T 2 ). 

The significance of these results is that they represent a 
universal lower bound on the energy dissipation rate in a 
reversible computation process. 



[11] Toffoli, Tommaso, "Reversible computing," Seventh 
Colloquium on Automata, Languages and Programming, 
J. W. de Bakker and J. van Leeuwen, eds., Springer 
1980, 632-644. 
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